This disc contains all the data and codes (Stata and Matlab) to produce all 
the results presented in Tables 1-A3 and Figure 1 in the manuscript titled 
"Misclassification between Patent Offices: Evidence from a Matched Sample 
of Patent Applications" by Alfons Palangkaraya, Elizabeth Webster and Paul 
H. Jensen.

The contents of the disc are in put into two folders "MATLAB" and "STATA
each containing respective data files and programming codes.

1. MATLAB folder

The MATLAB folder contains the following m files and the corresponding
data files which are used to produce the estimates presented in Table 3. 
These estimates are then used in the STATA do files to obtain the results 
summarise in all other tables.

These MATLAB programs were run using MATLAB Version 7.0.0 (R14).

1.1 misclassfication.m

This file produces the estimates of probit model with misclassification in 
the dependent variable (Equation 8) using all sample data summarised in 
Table 3. 

This m file uses type12.mat as the data file and misprobll3, rows.m, and 
stdn_cdf.m for likelihood function, row referencing, and approximation of the 
standard Normal cumulative distribution function. The rows.m and stdn_cdf.m 
are parts of the LeSage Econometrics Toolbox (http://www.spatial-econometrics.com/)


1.2 misclassfication_nobiosoft.m

This file produces the estimates of probit model with misclassification in 
the dependent variable (Equation 8) excluding all biotechnology and 
software patent applications from the sample data summarised in the last 
columns of Table 3. 

This m file uses type12.mat as the data file and misprobll3, rows.m, and 
stdn_cdf.m for likelihood function, row referencing, and approximation of the 
standard Normal cumulative distribution function. The rows.m and stdn_cdf.m 
are parts of the LeSage Econometrics Toolbox (http://www.spatial-econometrics.com/)

2. STATA folder 

The STATA folder contains a Stata do file, misclassification.do, and the
corresponding data files to produce the results summarised in Tables 1,
2, 4, 5, A2, and A3 and Figure 1.

